MERT-v1-330M is an advanced music understanding model trained based on the MLM paradigm, with a parameter scale of 330M, supporting 24K Hz audio sample rate, and suitable for various music information retrieval tasks.
Audio Classification
Transformers